DE-net: Dynamic Text-Guided Image Editing Adversarial Networks
نویسندگان
چکیده
Text-guided image editing models have shown remarkable results. However, there remain two problems. First, they employ fixed manipulation modules for various requirements (e.g., color changing, texture content adding and removing), which results in over-editing or insufficient editing. Second, do not clearly distinguish between text-required text-irrelevant parts, leads to inaccurate To solve these limitations, we propose: (i) a Dynamic Editing Block (DEBlock) that composes different dynamically requirements. (ii) Composition Predictor (Comp-Pred), predicts the composition weights DEBlock according inference on target texts source images. (iii) text-adaptive Convolution (DCBlock) queries features parts parts. Extensive experiments demonstrate our DE-Net achieves excellent performance manipulates images more correctly accurately.
منابع مشابه
Improvement of generative adversarial networks for automatic text-to-image generation
This research is related to the use of deep learning tools and image processing technology in the automatic generation of images from text. Previous researches have used one sentence to produce images. In this research, a memory-based hierarchical model is presented that uses three different descriptions that are presented in the form of sentences to produce and improve the image. The proposed ...
متن کاملNeural Photo Editing with Introspective Adversarial Networks
The increasingly photorealistic sample quality of generative image models suggests their feasibility in applications beyond image generation. We present the Neural Photo Editor, an interface that leverages the power of generative neural networks to make large, semantically coherent changes to existing images. To tackle the challenge of achieving accurate reconstructions without loss of feature ...
متن کاملGenerative Adversarial Text to Image Synthesis
Automatic synthesis of realistic images from text would be interesting and useful, but current AI systems are still far from this goal. However, in recent years generic and powerful recurrent neural network architectures have been developed to learn discriminative text feature representations. Meanwhile, deep convolutional generative adversarial networks (GANs) have begun to generate highly com...
متن کاملText-image Coupling for Editing Literary Sources
Users need more sophisticated tools to handle the growing number of image-based documents available in databases. In this paper, we present a system devoted to the editing and browsing of complex literary hypermedia including original manuscript documents and other handwritten sources. Editing capabilities allow the user to transcribe manuscript images in an interactive way and to encode the re...
متن کاملSpectral Image Visualization Using Generative Adversarial Networks
Spectral images captured by satellites and radiotelescopes are analyzed to obtain information about geological compositions distributions, distant asters as well as undersea terrain. Spectral images usually contain tens to hundreds of continuous narrow spectral bands and are widely used in various fields. But the vast majority of those image signals are beyond the visible range, which calls for...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2023
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v37i8.26189